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Abstract 

We study the nonlinear dynamics of a protein-DNA molecular system by treating DNA as a 
set of two coupled linear chains and protein in the form of a single linear chain sliding along the 
DNA at the physiological temperature in a viscous medium. The nonlinear dynamics of the above 
molecular system in general is governed by a perturbed nonlinear Schrodinger equation. In the 
non-viscous limit, the equation reduces to the completely integrable nonlinear Schrodinger (NLS) 
equation which admits N-soliton solutions. The soliton excitations of the DNA bases make localized 
base pair opening and travel along the DNA chain in the form of a bubble. This may represent the 
bubble generated during the transcription process when an RNA-polymerase binds to a promoter 
site in the DNA double helical chain. The perturbed NLS equation is solved using a perturbation 
theory by treating the viscous effect due to surrounding as a weak perturbation and the results 
show that the viscosity of the solvent in the surrounding damps out the amplitude of the soliton. 
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I. INTRODUCTION 



Protein-Deoxyribonucleic acid(DNA) interaction plays an important role in a large num- 
ber of cellular processes such as gene expression, suppression, replication, transcription, 
recombination, repair and in several other processes [1']. DNA participation in the above 
processes are mediated or catalyzed by DNA-binding proteins like polymerases, helicases, 
nucleases, isomerases, ligases and histones. Usually cellular processes start with the binding 
of a protein (enzyme) to the DNA. For example, the transcription process starts with the 
binding of RNA-polymerase (an enzyme protein) with a promoter site of the DNA and this 
binding is known to change the conformation of DNA by opening the bases. The above 
conformation change or base pair opening stresses the relevance of dynamics in understand- 
ing protein-DNA interaction. Experimentally, these conformational changes in DNA during 
protein-DNA interaction have been studied through atomic force microscopy 2|, dynamic 

n n n 

spectroscopy [3| , X-ray crystallography of protein-DNA co-crystal [Jj , NMR studies [5| and 
electrophoresis experiments (6[ as well as through fluorescent measurements j?]. Results 
from free energy calculations combined with molecular dynamics simulations also explain 
the base flipping through protein-DNA interaction j^, 9, 10, 11]. However, from a theoretical 
point of view, the understanding of protein-DNA interaction and its dynamics is still in its 
infancy. This is because the individual dynamics of DNA and protein itself is a very com- 
plex one. Also, interaction of the DNA molecule with the surrounding viscous medium and 
thermal fluctuations add to the complexity. The knowledge of the DNA dynamics through 
soliton-like excitations describe base pair opening or base flipping during DNA functions. 
In this direction, Englander et al 12|, u sed soliton excitations to explain base pair opening 
in DNA for the first time. Yomosa fisl . [l4 | proposed a plane base rotator model by taking 
into account the rotational motion of bases in a plane normal to the helical axis, and then 



Takeno and Homma 
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16| generalized the same and the nonlinear molecular excitations 



were shown to be governed by kink-antikink solitons. In contrast to the above, Peyrard 
and Bishop [17] and Christiansen and his collegues |18|] proposed a model by taking into 
account the transverse and longitudinal motions of bases in DNA to describe the base pair 
ope ning through breather. Later, several authors 19|, |20|, |2l|] including the present authors 
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231 ] suggested that either kink soliton or breather would be a good candidate to play 



a basic role in base pair opening in DNA. In all the above studies, the dynamics of DNA 
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molecular system has been studied without taking into account thermal fluctuations and 
viscosity of the surrounding medium and hence, it was shown that, the soliton represent- 
ing the base pair opening in DNA may travel for infinite distance and time. Recently, few 



authors studied the effect of thermal fluctuations [2j, |25| and viscosity of the surrounding 
medium on unzipping and soliton-like base-pair opening and showed that these effects damp 
the solitons and hence travel only for a limited distance 26|, 27, 28, 0, [s^. Like DNA 
dynamics, the dynamics of protein through soliton excitations also plays an im por tant role 
in describing energy transfer in alpha helix proteins. In this regard, Davydov [3l|, l32j who 
for the flrst time used soliton excitations to explain energy transfer in alpha helical proteins 
by proposing a new model by considering the coupling between the quantum transition oc- 
curring due to the vibrational structure of the C = O bond and the elastic longitudinal 
wave propagation along the chain with the helix behaving like a spring. Later on, the above 



study was extended by several authors 
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36| 



34| including one of the present authors 
to study the effect of exchange excitation between the chains, temperature, higher order 
interactions and interspine coupling. In contrast to Davydov's model, Yomosa 37| and very 
recently Sataric et al 381] studied soliton excitations in protein by considering the molecule 
as a toda lattice chain. The recent results on the statistical mechanics of protein-DNA sys- 
tem thr oug h coarse-grain and worm-like chain models describe unzipping of the DNA chain 



39, 
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42l |. Normally, protein molecule interacts with DNA either through non-specific 



interaction (sequence-independent) which is mainly driven by the electrostatic attractive 
force between positively charged amino acids and negatively charged phosphate groups of 
the DNA back bone or through specific interactions (sequence-dependent) including hydro- 
gen bonds, van der Waals force and water mediated bonds between the protein molecule 



and specific site of DNA 



43| . In a recent paper, Sataric et al [44] studied the impact of 



regulatory proteins through hydrogen bonds on breather excitations in DNA by considering 
Davydov model of amide-I vibration for protein dynamics and Peyrard-Bishop's model of 
stretching of hydrogen bonds for DNA dynamics and found that binding of protein to DNA 
gives rise to a large-amplitude breather. Except the above, no author has so far studied 
the impact of protein on the dynamics of DNA. Hence, in the present paper, we study the 
conformational changes that take place in the form of base pair opening in DNA through 
nonlinear excitations induced by a protein molecule through interaction at the physiological 
temperature in a viscous surrounding medium. The paper is organized as follows. In section 
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II, we present details of the above model and the Hamiltonian for the DNA molecular chain. 
The model Hamiltonian for protein-DNA interaction is given in section III. In section IV, 
we derive the equation of motion for the protein-DNA molecular system in the continuum 
limit and in the next section (section V) base pair opening in DNA as soliton solution of 
the associated nonlinear dynamical equation in a non-viscous medium is shown. The effect 
of viscosity of the surrounding medium on the base pair opening in DNA is understood 
through perturbation analysis in section VI. The results are concluded in section VII. 



II. MODEL HAMILTONIAN FOR DNA DYNAMICS 



We consider a DNA double helix in B-form along with a protein molecule (say RNA- 
polymerase) at physiological temperature in a surrounding viscous medium and study the 
impact of protein interaction on the dynamics of DNA. The model we propose here for 
the study treats DNA as a set of two coupled linear molecular chains and protein as a 
single linear chain interacting through a linear coupling. A schematic representation of 
the above protein-DNA molecular system is shown in Fig. 1(a). In the figure R and R' 
represent the two complementary strands in the DNA double helix. Each arrow represents 
the direction of the base attached to the strands and the dots between arrows represent 
the net hydrogen bonding effect between the complementary bases. The shaded ellipse 
overlapping the DNA double helical structure represents interaction of protein with the DNA 
molecule. The conformation and stability of DNA double helix is mainly determined by the 
stacking of bases through intrastrand dipole-dipole interaction and through hydrogen bonds 
between the complementary bases (interstrand interaction). From a heuristic argument, it 
was assumed that the hydrogen bonding energy between the complementary bases depends 
on the distance between them. Generally, the distance between the complementary bases 
can be expressed through longitudinal, transverse and rotational motion of bases. Among 
the different motions, the rotational motion of bases is found to contribute more towards 
the opening of bases pairs. Hence, for our study, we consider a plane-base rotator model for 



DNA 



13 



Iq which the authors have extensively used for studying pure DNA dynamics in 



the recent times 



22 



23l | . In order to find the distance between the complementary bases, in 
Figs. 1(b) and 1(c), horizontal projections of the n*^ base pair in the xy and xz-planes are 
presented respectively. In the figure, Qn and Q'^ denote the tip of the n*^ bases belonging 
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FIG. 1: (a) A schematic representation of a DNA double helix molecule with a protein molecule 
interacting with it. (b) A horizontal projection of the n*^ base pair of the DNA in the xy-plane. 
(c) A horizontal projection of the n^^ base pair of DNA in the xz-plane. 



and 



'n) 



to the complementary strands R and R' at P„ and respectively. Let 
represent the angles of rotation of the bases in the n^^ base pair in the xz and xy-planes 
respectively. By using simple geometry in Figs. l(b,c), we can write the distance between 



the tips of bases as 



m 



{QnQ'n 



I \2 



2 + + {Zn - z'^f + 2{zn - 4) 
X (cos On — COS 6'^ — 4r [sin On cos 0„ 
+ sin 6'n cos (t)'n\ + 2 [sin On sin O'n (cos 

X cos (p'n + sin (pn sin 0^) — cos On COS 0'^ , 



(1) 



where 'r' is the radius of the circle depicted in Fig. 1(b). 



The hydrogen bonding energy can be understood in a more clear and transparent way 
by introducing quasi-spin operators Sn = (5'^', S^, 5*^) = (sin On cos 0„, sin On sin 0„, cos On) 
and Sn = {S'n -I S'y , S'n) = (sin 6*^ cos 0^, sin 6'^ sin 0^, cos^^^). Using the above, Eq.([l]) can 
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be rewritten as 



{QnQ'nf 



-4r \s: + S': . 



(2) 



While writing Eq.([2]), we have chosen Zn = z'^. It is interesting to note that the form 
of {QnQ'n)'^ given in Eq.(l2]) is the same as the Hamiltonian for a generahzed form of the 
Heisenberg spin model. Therefore, the intrastrand base-base interaction in DNA also can be 
written using the same consideration. Also, it is reasonable to think that if such a quasi-spin 
model can be used in this problem, the double strand DNA and the rung-like base pairs can 
be conceived as a coupled spin chain model or a spin ladder system. 

With the above consideration, we are at liberty to use the following Heisenberg model of 
the Hamiltonian for a coupled spin chain model or spin ladder system with ferromagnetic- 
type exchange interaction between nearest neighbouring spins in the same lattice (equivalent 
to stacking of bases in one strand i.e. intrastrand interaction) and ferromagnetic or antifer- 
romagnetic rung-coupling (equivalent to hydrogen bonds between complementary bases i.e. 
interstrand interaction). 



Thus, the DNA double helical chain is mapped onto a two coupled spin chain model or a spin 
ladder system with ferromagnetic legs (J > 0) and ferromagnetic (/i > 0) or antiferromag- 
netic (/i < 0) rungs. Therefore, in Hamiltonian ([3]), the terms proportional to J correspond 
to stacking interaction between the n*'' base and its nearest neighbours in the two strands 
and the last term which is proportional to fi corresponds to the interstrand interaction or 
hydrogen bond energy between the complementary bases. In equilibrium, the parameter n 
is expected to be less than zero (corresponding to antiferromagnetic coupling). 
As DNA works at the biological temperature, the hydrogen atom attached to the bases are 
also normally in a thermally excited state. Therefore, it is necessary to generalize the above 
model into a thermal DNA. Thus, to include the effect of thermal phonons into the system, 




(3) 



n 
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we add the following Hamiltonian. 



h ki{Xn — Xn+lY 

llTLi 



(4a) 



Hd-t = «i 5Z(^n+i - X„_i)(S„ ■ S;), (4b) 

n 

where p„ = miX„, with overdot representing the time derivative and ki represents the elastic 
constant, mi is the mass of the hydrogen atom attached to the base and X„ represents the 
displacement of it at the n*^ site along the direction of the hydrogen bond. The interaction 
Hamiltonian H^-t given in Eq. fl4bp represents the coupling between the vibration of the 
above hydrogen atom (thermal fluctuation) and the rotation of bases. 

III. MODEL HAMILTONIAN FOR PROTEIN-DNA INTERACTION 

As protein binding to DNA induces large mechanical stress on it, which causes confor- 
mational changes, and thus acts as a precursor for all the functions of DNA. Hence, we 
investigate here the dynamics of a DNA when a protein molecule interacts with it by sliding 
on the DNA chain. Eventhough, proteins are much larger in size than the DNA(substrate), 
only a small portion of the protein molecule, namely the active site is directly interacting 
with the DNA molecule (see Fig.l (a)). Hence, we consider the short active site region of 
the protein molecule that interacts with the DNA as a linear chain as has been treated by 
several other authors (see for e.g. Sataric et al [38|). Thus, as said earher, we propose the 
model for the above protein-DNA molecular system by considering DNA as a set of two 
coupled linear chains and the active site region of the protein molecule as a linear molecular 
chain interacting with the bases which is schematically shown in Fig. 2. The DNA part of 
the sketch is the linearized chains of Fig. 1(a) and n,n ±1 represent the n*^ and {n ± 1)*'* 
sites representing the bases (similar to Pn, Pn, Pn±i, P!n±i Figs. l(b,c) ). G represents 
the linear protein molecular chain (active site region) interacting with the bases by sliding 
on the DNA chains. To explain the model for the above system further, we consider the 
protein molecule as a collection of mass points, with each mass point representing a peptide 
unit and connected by linear springs exhibiting longitudinal stretching motion parallel to 
the helical axis of DNA, i.e. along z-direction which couples to the hydrogen bonds of bases 
in a linear way. Hence, the model Hamiltonians for the longitudinal stretching motion of 
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FIG. 2: A sketch representing the protein-DNA molecular system 



the protein molecule (Hp) and its interaction with the DNA chain {Hu^pj is written as 



Hr, 



Hd-p 



E 



n 
n 



(5a) 
(5b) 



where qn = 'm2yn, and m2 is the mass of the peptide. ?/„ denotes displacement of the 
n*'^ peptide in the protein chain from its equilibrium position and k2 represents the elastic 
constant associated with the small amplitude oscillation of the protein molecule. The in- 
teraction Hamiltonian Ho^p given in Eq. (l5bl) is chosen to represent the change in hydrogen 
bonding energy due to change in the distance between the adjacent peptide units along the 
hydrogen bonding spines of the protein molecule and a2 is the coupling coefficient. Further, 
as the protein molecule is assumed to slide on the DNA chain, the interaction energy along 
z-direction is expected to be dominant over the energy in the xy-plane normal to it (see 
Eq. (l5b1) ). Thus, the total Hamiltonian for our model can be written using Eqs. (j3]), (l4all . 
( l4bll . (l5aD and (l5b]) as 

H = Ho + Hp + Hj:,_p + Ht + Hj:i_t, 

= [-{^(Sn • Sn+1 + • S'„+i) +/i(Sn ' S'„)} 



2mi 2m2 

+ai(X„+i - X„_i)(Sn ■ S'„) + a2{yn+i - yn-i)S^S'^ 



(6) 



Before proceeding further, for the sake of completeness, we present the form of the Hamil- 
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tonian ([6]) in terms of angles of rotation of bases as 

H = [- J{sin 9n sin 9n+l COs{(f)n+l - (t)n) + COS On COS 9n+l 

n 

+ sin e'^ sin cos((/)^^_;^ — 0^) + cos 6'^ cos 

-[/i - - X„_i)]{sin6'„ sin6'^ cos(0„ - 0^) + cos6'„ cos 6'^} 

2 2 

Zrrii 2m2 

+a2{yn+l - Vn-l) COS 9n COS 9'J , (7) 

In the case of Heisenberg spin systems, when the spin value is large, the spin dynamics 
is understood either through a classical approach or under semi-classical approximation 
by bosonizing the Hamiltonian (see for e.g. Ref. ^|). Also, it should be mentioned 
that creation and annihilation operators were used to represent the Hamiltonian while 
studying the transport of charge and hole along short DNA molecules Q, and while 
investigating the nonlinear dynamics of alpha helical protein molecules using the model 
proposed by Davydov [31]. Therefore, along the same lines, in order to understand the 
dynamics of the above protein-DNA molecular system, here also, we bosonize the Hamil- 



tonian ([6]) using Holstein-Primakoff (H-P) representation [48|] for quasi spin operators by 
writing S:^ = [1 — e^a]^a„]^/^ea„, S"^ = V2ea'l[l — e^a|ja„]^/^, S"^ = [1 — e^o-J^^n], where 
= S^± iS^. In the low temperature hmit, aj^a^ << 25, and hence, the H-P transforma- 
tion can be expanded in a power series in terms of the parameter e = 1/ as 

Sn = v^e[l - j4a„ - — 4a„4a„ - 0(e^)]a„, (8a) 
Sn = - j4a„ - — 4a„4a„ - 0(e^)], (8b) 

and similar expansions for S'^, S'~ and S'^^ in terms of hnipl^). Here al^{h\^) and anipn) repre- 
sent creation and annihilation operators of the n^^ bases and satisfy the usual commutation 
relations, [a^, a^] = b^] = 5mn, [am, = [bm, bn] = Wm, a^] = [bl,, bl] = 0. Substituting 
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Eqs. (IHal) and (l8b|) in Eq.Q , we have the Hamiltonian upto O(e^) as 



H 



Pn 



fl2 



2mi 2m2 



+ - Xn+lf + k2{yn - Vn+lf 



+e {- J(a„aJ,^i + a 
-[/i - «i(X„,+i - X„_i)](a„6jj + aih 



'n ^n^n 



-blbn) - a2iyn+i - ?/n-i)(4a„ + blbn)}] . 



(9) 



IV. THE DYNAMICAL EQUATIONS 

Having written down the Hamiltonian in the semi-classical description, the dynamics 
of the protein-DNA molecular system can be understood by constructing the equations of 
motion as 



ih- 



dt 



an, H 



+1) (^n+l I ; 



(10) 



and similar one for 6„. The equations of motion for X„ and ?/„ are written using the 
Hamilton's equations of motion ^ = -^, ^ = ^, ^ = _|gand^ = g-. The 
explicit form of the equations of motion can be derived by substituting Hamiltonian ([9]) in 
the above equations of motion for a„, 6„, X„ and ?/„. Thus, we get 

.dan 



rrii- 



dt 

.dbn 



m2- 



d^Jn 



-J(a„+i - 2an + ttn-l) - [/i + ai{Xn+i 
-Xn-l)]{bn - an) - a2{yn+l - yn-l)an, 

-J{bn+i - 26„ + bn-i)-[fi + 

-X„_i)](a„ - bn) - a2{yn+l - yn^l)bn, 

ki{Xn+i - 2Xn + Xn-i) + ai[a 

~(^n+l(^n+l + bn_ibn-l — 6jj_|_i6„+i 

hiVn+i - 2y„, + yn-i) + a2[al_ian-i 

—al_^_ian+l + bl^_^bn-l — fejj+i^n+i]. 



(11a) 

(lib) 



(11c) 



(lid) 
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While writing the above equations flllatllldll . we have rescaled the time variable and re- 
defined mi, 1712, ki and /c2- In order to represent the large amplitude collective modes by 



coherent states, we introduce Glauber's coherent state representation 49|| for boson opera- 



tors al^\u >= u* |m >, an\u >= Un\u >, \u >= n„|u„ > and bl^\v >= f * |f >, >= > 
, \v >= Iln\vn > with < u\u >= 1 and < v\v >= 1 where m„ and Vn are the coherent 
amplitudes of the operators a„ and 6„ for the system in the states \u > and \v > respec- 
tively. Further, as the length of the DNA and the protein chains are very large compared 
to the lattice parameter, we make a continuum approximation by introducing the new fields 
Un — ^ u{z,t),Vn(t) — > v{z,t), Xn{t) X{z,t) and Unit) y{z,t), where z = nl with the 
expansions m„±i = u{z,t) ± + ± 0{l^) and similar ones for Vn±i, Xn±i and yn±i- 
Under the above approximations, the equations of motion f ilial) , ( lllbl) . f lllcp and ( llldj) after 
rescaling z and redefining ai, 0^2, ki and ^2 upto OiJ?) can be written as 

iut = -Uzz - (yu - aiXz){v -u) -a2 VzU, (12a) 

ivt = -v^z - (/U - aiXz){u -v) -a2 VzV, (12b) 

rriiXtt = kiXzz — OLi \\u\^ + \v\^ — uv* — u*v\z, (12c) 

m2ytt = k2yzz - 0^2 [lul"^ + \v\^]z- (12d) 



In Eqs. fll2atil2d| . the suffices t and z represent partial derivatives with respect to time t and 



the spatial variable z respectively. On subtracting Eq. fll2al) from (]12bl) and by choosing 
V = —u, Eqs. 12 (a-d) can be written as 

iut - {2/i - {a2yz - 2aiXz)}u + u^z = 0, (13a) 

Xu--^Xzz = - — [\u\%, (13b) 
rrii nil 

k2 2^2 |2n „ s 

ytt Vzz = [\u\ \z- (13c) 

m2 ni2 

It may be mentioned that, addition of Eqs. fll2ap and (]12b|) satisfy identically. The term 



proportional to fi in Eq. fll3al) can be transformed away using the transformation u{z,t) = 
u{z,t)e~'^'^^^ and the result reads (after dropping the hat) 

iut + Uzz + {'^a\Xz + a2yz)u = 0, (14) 

The set of coupled equations (113bl c) and (fT^ describe the dynamics of our protein-DNA 
molecular system at the biological temperature, when the protein molecule binds to the 
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DNA double helical chain through linear harmonic coupling. The dynamics is found to 
be governed by the excitation of DNA bases and thermal vibration of the hydrogen atoms 
attached to the bases combined with the longitudinal motion of peptide units of the binding 
protein. In particular, we are concerned with the nonlinear excitation of bases induced by 
protein and thermal fluctuations, in which a cluster of DNA bases may undergo a large 
excursion as compared to the rest of the bases. It may be noted that when a\ = a2 = 0, 
Eqs. f|T^ . fll3bl) and fll3cl) are decoupled and reduced to a set of linear equations. Thus, 
when the protein molecule is detached from the DNA chain [a\ = 02 = 0), the dynamics is 
governed by the following well known set of linear equations. 

iut + = 0, (15a) 

Xtt - —X,, = 0, ytt - —y.z = 0. (15b) 
nil ^2 

While Eq. fll5ap is the time- dependent Schrodinger equation for a free particle, Eqs. fll5bp 
are the homogeneous linear wave equations. Eq. fll5al) admits plane transverse wave solution 
of the form u = MoC**-''^"^*^ with the dispersion relation w = k? where uq is the constant 
amplitude. On the other hand, Eq. (]15bl) admits linear non-dispersive wave solutions 
X = fi{z- vit) + gi{z + vit) and y = f^iz - V2t) + g2{z + V2t) where /i, gi and /a, g2 are 
arbitrary functions and f 1 = \J^^ and f 2 = -^/^j represent the constant phase velocities 
of the wave. When the protein molecule started interacting with the DNA molecular chain 
at the physiological temperature, i.e when ai 7^ and ^2 7^ 0, the excitation energy of 
the protein-DNA molecular system at the physiological temperature (thermal fluctuation) 
increases, and nonlinearity started playing its role. Thus, the set of full coupled nonlinear 
equations become important and it is essential that Eqs. (llSbj c) and (IT^ should be solved 
in their full form to understand the underlying nonlinear dynamics. 



V. SOLITON, BASE PAIR OPENING AND BUBBLE TRANSPORT 

In order to solve the set of coupled equations f|T4|) . fll3bl) and fll3cl) . in their full form 
we differentiate Eqs. (113bp and fll3cp with respect to z once and define X{z^t) = X^ and 
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Y{z,t) = Hz, so that Eqs. (fT^ . (113bp and fllScp are written as 

iut + Uzz + (2aiX + a2Y)u = 0, (16a) 

4q;i 

X« - v^Xzz + —[H%z = 0, (16b) 
mi 

>"«-^2n.+— [iwr].. = o. (16c) 

Now, we rewrite Eqs. (]16bp and fll6cl) by introducing the wave variable ( = z — v^t and 
writing X{z,t) ^ X(C),F(2,t) ^ F(C) . 

Xc^-2/3i[hi|\^ = 0, (17a) 
l/Cc-2/32[|^i|\c = 0, (17b) 



where /3i = and /?2 = On integrating Eqs. f ll7al) and ( ]17b|) with respect 

to C twice and assuming both the integration constants to be zero, we get X = 2(3i\u\^ and 
Y = 2/^2 Iwp, which upon using in Eq. fll6al) gives 

iUt + Uzz + 2\U\^U = 0. (18) 

While writing Eq. (|T8i) . we have made use of the transformation u{z,t) = (2ai/3i + 
a2P2)~^U{z,t). Eq. ( |T8l) is the well known completely integrable nonlinear Schrodinger 
(NLS) equation which has been solved for N-soliton solutions using the inverse scattering 



transform (1ST) method 50|. For instance, the one soliton solution is written as 



U = risech[ri{z - 2^t - e^)\exv\ii{z - 2^t - ^o) + i{iv^ + ^^)t - fXo)], (19) 

where rj,^,6o and ctq are four real parameters which determine the propagating amplitude, 
velocity, initial position and initial phase of the soliton. The solitons in the protein-DNA 
molecular system with thermal fluctuation are formed as a result of the dynamical balance 
between the dispersion due to interaction of intrastrand (stacking) dipole vibrations in each 
strand of the DNA with the nonlinearity provided by the interaction between the hydrogen 
bonds in DNA and the local displacement of the peptide groups in the protein molecule 
and the thermal phonons. The longitudinal waves that arise in the protein molecule and 
hydrogen atoms in DNA in turn provide a potential well that prevents dispersion of the 
rotational energy of the bases in DNA. Thus, the propagation of rotation of bases in DNA 
is coupled to the longitudinal waves of protein and the coupled excitations propagate as a 
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(b) 



FIG. 3: (a) One soliton solution ( Eq. [T9|) of the NLS equation . (b) A schematic representation 
of formation of bubble with the solitons and its propagation along DNA. 



localized and dynamically self-sufficient entity called solitons which travel along each strand 
of the DNA chain. In Fig. 3(a), we have plotted the square of the absolute value of 
the one soliton solution U i.e. |f/p as given in Eq. f|T9|) . In Fig. 3(b), we present a 
schematic representation of the coherent base excitations in DNA in terms of rotation of 
bases induced by the protein molecule in the form of solitons propagating along the two 
strands which collectively form a travelling bubble created by energy delocalization due 
to nonlinear effects. Thus, the soliton solution describes an open state configuration in 
the individual strands of the DNA double helix which collectively represent a bubble. In 
the figure, the shaded ellipse represents the region of interaction of the protein molecule 
with DNA where the bubble is formed. Thus, the protein molecule acts as a zip-runner 
in opening the bases in DNA chain during the process of transcription. Similar results 

n 

have been observed experimentally by Ha et al|51], on the winding and unwinding of E-coli 
Rep helicase-DNA complex. Further, our results on bubble propagation in DNA due to 
protein interaction is in accordance with the experimental data on the binding of RNA- 



polymerase to promoter [5 



53 



54 



55|, 



56l |. From the expression for the soliton namely 
u = (2ai/3i + a2f32)-hsech[r]{z - 2^t - eo)]exp[i^{z - 2^t - Oq) + i{{rj^ + (^)t - (Tq)], it is 
noted that the amphtude of the soliton depends on the coupling of the DNA excitations to 
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the thermal phonons (ai) and to the molecular vibrations of the protein (0:2) which were 
also respossible for nonlinearity in the soliton equation. For large coupling, it is expected 
that the amplitude of the soliton decreases. Recently Campa \24\ also showed through 
simulation studies that, in the case of large thermal coupling, the bubble travels only for a 
short distance with decreasing amplitude. 



VI. EFFECT OF VISCOSITY 



In a more realistic description of the dynamics of protein-DNA system, it is important to 
consider the effect of the surrounding medium or environment. Effectively, the interaction of 
DNA with the surrounding medium reduces to viscous damping effect. It is known that, in 
the case of a protein-DNA system the solvating water acts as a viscous medium that makes 



the nucleotide oscillations to damp out [57|] . The effect of viscous force exerted on the DNA 
chain can be taken into account by adding a term of the form —i'yU to the right-hand side of 
the completely integrable nonlinear Schrodinger equation ( JT9i) which now becomes a damped 



nonlinear Schrodinger equation as given below. 

lUt + U,, + 2\U\''U = -ijU (20) 

As the viscosity of water is temperature dependent, from a simple fluid mechanics argument, 
one can estimate the magnitude of the damping coefficient as very small at the physiolog- 
ical temperature 28|]. Hence, we treat the term proportional to 7 in Eq. (l20l) as a weak 
perturbation. When 7 = Eq. ( l20l) reduces to the completely integrable NLS equation as 
given in Eq. fll8p and the one soliton solution is given in Eq. fll9p which can be rewritten 
for convenience in the form 

U = r]sechr]{9 - 9o) exp[t^{9 - 9o) + z(a - ^o)], (21) 

where f = -2^, f = 1, f = v' + e^nd f = 0. 
We carry out a perturbation analysis 



58| to understand the impact of the viscous force by 
introducing a slow time variable T = 7^ and treat the quantities 77, ^, ^0 and (Tq as functions 
of this time scale and hence the envelope soliton solution fl2T|) is written as 



U = 11(9, T- 7) exp[«e(^ - ^^o) + - (22) 
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Under the above assumption of quasi-stationarity, Eq. (l20l) reads 



(23) 



where 

F{U) = [{e - eo)iT - iOoT - (Tot]U - i[Ut + U]. (24) 

We assume Poincare-type asymptotic expansion for tj as JJ{9,T;'y) = Yl^=i1^^niG,T) 
and further restrict ourselves to calculation of order (7) such that U{9,T;'y) = Uq{9,T) + 
'-fUi{6,T), where Uq = risech.[ri{d — 6q)]. Further, we assume that f/i = 0i + iipi, where 0i 
and ipi are real. On substituting the above, in Eqs. fl2^ and ([23]), we obtain 

= -r/Vi + hee + QpoUi = ReF(f/o), (25a) 
LsV'i = -ri^i + i^iee + 2|f/o| = ImF(?7o), (25b) 



where 



ReF(;7o) = [{e - d^)^T - iO^T - (JoT%, (26a) 
ImF(f/o) = -[f/oT + f/o]. (26b) 



In Eqs. fl25al) and fl25bp . Li and L2 are self-adjoint operators. It may be checked that, the 



solutions of the homogeneous part of Eqs. fl25ap and (125bp are Uqq and f/o and hence we 
have the following secularity conditions. 

/oo 
Uo0ReF{lJo)de = 0, (27a) 
-00 

/oo 
UolmF{Uo)de = 0. (27b) 
'OO 

On evaluating the above integrals after substituting the values of Uog, Uq, ReF(f/o) and 
ImF(f/o) , we obtain C,t = 0, t]t = —21], which can be written in terms of the original time 
variable t after integrating once as 

e = eo, = r^oe-^^*, (28) 



where 770 cind ^0 cire the initial amplitude and velocity of the soliton. The first of Eq. 
says that when the protein-DNA system interacts with the surrounding viscous medium the 
velocity of the soliton remains constant. However, the second of Eq. (1281) says that, the 
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amplitude of the soliton decreases as soliton propagates. In other words, the viscous nature 
of the solvent medium damps out the soliton exponentially and hence the soliton is expected 
to travel only a short distance. 

Now, we construct the perturbed soliton by solving Eqs. fl25al) and fl25bl) . For that, first 
we solve the homogeneous part of Eq. fl25al) . which admits the following two particular 
solutions. 

011 = sechr/(^ - eo)tanhr/(^ - ^o), (29a) 
+ - tanhr7(6' - sinh?7(6' - 

-sechr/(^-^o)]- (29b) 
The general solution can be found out by using the following expression. 

01 = Ci0ii + C2012 - 011 / 0i2ReF(f/o)d^ 

J — oo 

+012 I (pnReF{Uo)d9. (30) 

J — oo 

Here Ci and C2 are arbitrary constants. We construct 0i by substituting the values of 
011, 012 and ReF{Uo) given in Eqs. (I29al) . (]29bp and fl26al) in Eq. (!30|) and after evaluating 
the integrals, we get 
1 



V 



sech?7(6' — 6q) + 



Off 1 

Ci + ^{e-eo) + -{e-eo) 



i^OoT + (Tot)] sechT]{e - Oq) tan\ir]{e -60) + ^ smhr]{9 - 0^) tanhr/(^ - ^o)- (31) 

Zr] 

The last term in Eq. (!3T]) is a secular term that leads to a solution which is unbounded and 
hence, it is removed by choosing the arbitrary constant C2 = 0. Further, by applying the 
boundary conditions 0i|6i=6io =constant = c and (pie\e=eo = 0, we obtain ^{^Oqt + Cqt) = —c 
and Ci = 0. Using the above results in Eq. ( l3Ti) . the general solution 0i is written as 

01 = c[l -{6- 60) tanhr]{e - 0o)]sech?7(^ - ^o)- (32) 

Next, we solve Eq. (]25b|) . the homogeneous part of which admits the following particular 
solutions. 

■011 = sechr7(6' — 6*0), (33a) 

i^i2 = ^[v{0 - eo)sechr]{e - 60) + smhr]{e - 60)]. (33b) 
Zf] 
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Knowing two particular solutions, the general solution of Eq. (125bp can be found from 



i^i = Csz/'ii + C4^i2 - i^n / iJi2lmF{Uo)de 

J — oo 

+i^i2 f iJiilmF{Uo)d9, (34) 

J — oo 

where C3 and C4 are arbitrary constants. We construct the explicit form of tpi by substituting 
the values of 4'ii,ipi2 and ImF(t/o) given in Eqs. fl33al) . (]33bp and (126bp and evaluating the 
integrals. 

^1 = jca + ^(^^ - ^^o) - ^liO - ^^o){|^(^^ - ^^o) - ^^ot} + eoTtfxnhriiO - 9o) 
+eoT{0 - eo)sech.'^r]{e - ^0)]} sech(^ - ^o) 

+^sinhr/(^-^o). (35) 

The above solution for tpi contains secular term that is a term proportional to sinh?7(0 — 6q) 
which can be removed by choosing C4 = 0. Further, we obtain C3 = and 6qt = upon 
using the boundary conditions ipi\e=eo = and 4'w\9=9o = 0. On using the above results in 
Eq. (1351) . the final form of ipi is written as 

i^i = l{e-eo)hechr]{9-eo). (36) 

Using the results given in Eqs.( l32l) and ( l36l) we write down the final form of the first order 
perturbed soliton U = [Uq + 7(0i + iipi)) exp[2^(6' — 6*0) + i(cr — ctq)] (by choosing 7 = 1) as 

U = [r]sech7]{e - Oq) + c[l -{6- 9^) tanhr7(0 - ^o)] 

+^|(^ - ^o)'sech77(e - ^0)] exp[ze(^ - ^0) + ^{cx - ctq)]. (37) 

In Fig. 6, we have plotted the square of the absolute value of the perturbed soliton i.e. 

from Eq. fl371) . From the figure, we observe that the amphtude of the soliton decreases 
as time progresses, because of the damping due to viscosity of the surrounding medium. 
Therefore, when the viscosity is high the soliton is expected to travel only for a short time 
and will stop after that. On the other hand, when the viscosity is low, the soliton will 
travel for some time. Similar results have also been observed by Yakushevich ^!\\ through 
numerical analysis. They showed that when the viscosity is low, the soliton passes more than 
3000 chain links in DNA like a heavy Brownian particle and when the viscosity is large, the 
soliton stops after a few chain links. 
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FIG. 4: Square of absolute value of the perturbed soliton solution (Eq. (|37p ) under viscous 
damping. 



VII. CONCLUSION 



In this paper, we investigated the nonlinear dynamics of a protein-DNA molecular sys- 
tem under thermal fluctuations in a viscous surrounding medium by considering DNA as a 
set of two coupled linear chains and protein as a single linear molecular chain interacting 
through linear coupling. In the no n- viscous limit, the dynamical equation for the system 
is derived from the Hamiltonian through a semiclassical approach using Glauber's coherent 
state method combined with Holstein-Primakoff (H-P) bosonic representation under contin- 
uum approximation. The equation of motion reduces to a set of coupled equations in which 
the equation for DNA dynamics is a nonlinear equation for rotation of bases and inhomo- 
geneous wave equations representing vibration of hydrogen atom in the bases and for the 
protein molecule. In the linear limit, the above equations are decoupled and reduced to time- 
dependent Schrodinger equation for a free particle and one-dimensional homogeneous linear 
wave equations. While the former one admits dispersive plane transverse wave solution the 
later ones give non-dispersive wave solutions. When protein molecule and thermal phonons 
started interacting with the DNA, the coupling introduces nonlinearity into the dynamics of 
bases in DNA, and the set of coupled equations of motion reduce to the completely integrable 
nonlinear Schrodinger equation that admits N-soliton solutions. During interaction the en- 
ergy of the excited DNA molecule increases and the nonlinearity localizes the energy thus 
forming localized solitons. The solitons represent opening of base pairs in both the strands 
which collectively form a bubble travelling along the DNA double helical chain at physiolog- 



19 



ical temperature. Thus, the protein molecule acts as a zip runner that opens the base pairs 
which close when the protein molecule progress along the DNA chain. For, strong coupling, 
the amplitude of the soliton is expected to decrease. On the other hand, the dynamics of the 
system in the viscous medium is governed by a perturbed nonlinear Schrodinger equation. 
The effect of viscosity is understood by carrying out a multiple scale perturbation analysis. 
The results show that while the amplitude of the soliton decreases and the velocity remains 
constant as time progresses. The soliton, damps out quickly in the case of high viscosity and 
moves for some time when the viscosity of the surrounding medium is low. The events that 
happen in the present study may represent the binding of an RNA-polymerase to a promoter 
site in the DNA during the transcription process. Our results have very strong coincidence 



with the experimental data 



51, 



53 



54 



55 



56| that the binding of RNA-polymerase to the 



promoter site in DNA is accompanied by a local distorsion of the DNA bases in the form of 
solitons which can propagate along the DNA double helix. In nature, protein binds to DNA 
in a very specific site like promoter, coding or terminator which has a specific sequence of 
bases and this makes the strands site-dependent or inhomogeneous. Hence, it is important 
to understand the nonlinear dynamics of inhomogeneous DNA with the protein bound to 
specific site of DNA and the study is under progress. 
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